Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
ki | 18175 | 265 | 5 | 53.0000 |
close | 1828 | 43 | 1 | 43.0000 |
Tudi | 846 | 35 | 1 | 35.0000 |
In | 1641 | 67 | 2 | 33.5000 |
Še | 527 | 27 | 1 | 27.0000 |
Po | 1402 | 102 | 4 | 25.5000 |
Ker | 717 | 25 | 1 | 25.0000 |
ampak | 1318 | 68 | 3 | 22.6667 |
Je | 583 | 20 | 1 | 20.0000 |
Lahko | 369 | 20 | 1 | 20.0000 |
kjer | 1645 | 40 | 2 | 20.0000 |
Kot | 654 | 39 | 2 | 19.5000 |
Na | 2350 | 153 | 8 | 19.1250 |
Od | 349 | 19 | 1 | 19.0000 |
Se | 344 | 18 | 1 | 18.0000 |
Pri | 1047 | 36 | 2 | 18.0000 |
Seveda | 371 | 15 | 1 | 15.0000 |
V | 5355 | 302 | 21 | 14.3810 |
Z | 810 | 43 | 3 | 14.3333 |
saj | 2116 | 56 | 4 | 14.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Republike | 398 | 1 | 23 | 0.0435 |
najprej | 364 | 1 | 22 | 0.0455 |
storitev | 692 | 1 | 16 | 0.0625 |
vrste | 207 | 1 | 15 | 0.0667 |
dr | 352 | 1 | 13 | 0.0769 |
Zakona | 128 | 1 | 9 | 0.1111 |
mislim | 397 | 2 | 17 | 0.1176 |
Oct | 1639 | 1 | 8 | 0.1250 |
zopet | 187 | 1 | 8 | 0.1250 |
nekoč | 120 | 1 | 8 | 0.1250 |
radi | 215 | 2 | 16 | 0.1250 |
dnevnega | 95 | 1 | 8 | 0.1250 |
največjih | 79 | 1 | 8 | 0.1250 |
2009 | 446 | 6 | 47 | 0.1277 |
delali | 69 | 1 | 7 | 0.1429 |
predstavitev | 103 | 1 | 7 | 0.1429 |
rečeno | 84 | 1 | 7 | 0.1429 |
oceni | 80 | 1 | 7 | 0.1429 |
količino | 79 | 1 | 7 | 0.1429 |
sledi | 148 | 1 | 7 | 0.1429 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II